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Abstract 



While the asymptotic relative efficiency (ARE) of Wilcoxon rank-based tests for 
location and regression with respect to their parametric Student competitors can 
be arbitrarily large, Hodges and Lehmann (1961) have shown that the ARE of 
,-h the same Wilcoxon tests with respect to their van der Waerden or normal-score 

counterparts is bounded from above by 6/n ~ 1.910. In this paper, we revisit that 
result, and investigate similar bounds for statistics based on Student scores. We 
also consider the serial version of this ARE. More precisely, we study the ARE, 

,-j- ■ under various densities, of the Spearman-Wald-Wolfowitz and Kendall rank-based 

autocorrelations with respect to the van der Waerden or normal-score ones used 

q? . to test (ARM A) serial dependence alternatives. 

Keywords: Asymptotic relative efficiency, rank-based tests, Wilcoxon test, van 
,_+ ■ der Waerden test, Spearman autocorrelations, Kendall autocorrelations, linear 

O ■ serial rank statistics 



1. Introduction 

The Pitman asymptotic relative efficiency AREj(0 1 /0 2 ) under density / of a 
test <pi with respect to a test 2 is defined as the limit (when it exists) as n\ tends 
to infinity of the ratio n 2; /(n 1 )/n 1 of the number n 2 -j(n 1 ) of observations it takes 
for the test 2 , under density /, to match the local performance of the test <pi based 
on ri]_ observations. That concept was first proposed by Pitman in the unpublished 
lecture notes [28] he prepared for a 1948-49 course at Columbia University. The 
first published rigorous treatment of the subject was by Noether [25] in 1955. A 
similar definition applies to point estimation; see, for instance, [6] for a more 
precise definition. An in-depth treatment of the concept can be found in Chap- 
ter 10 of Serfling [31], Chapter 14 of van der Vaart [32], or in the monograph by 
Nikitin [24]. 
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The study of the AREs of rank tests and R-estimators with respect to each 
other or with respect to their classical Gaussian counterparts has produced a num- 
ber of interesting and sometimes surprising results. Considering the van der Waer- 
den or normal-score two-sample location rank test v aw an d its classical normal- 
theory competitor, the two-sample Student test <f)j^, Chernoff and Savage in 1958 
established the rather striking fact that, under any density / satisfying very mild 
regularity assumptions, 

ARE/^vdw/^V) > 1, (1.1) 

with equality holding at the Gaussian density f = (ft only. That result implies that 
rank tests based on Gaussian scores (that is, the two-sample rank-based tests for 
location, but also the one-sample signed-rank ones, traditionally associated with 
the names of van der Waerden, Fraser, Fisher, Yates, Terry and/or Hoeffding — 
for simplicity, in the sequel, we uniformly call them van der Waerden tests — 
asymptotically outperform the corresponding everyday practice Student t-test; 
see [1]. That result readily extends to one-sample symmetric and m-sample loca- 
tion, regression and analysis of variance models with independent noise. 

Another celebrated bound is the one obtained in 1 956 by Hodges and Lehmann, 
who proved that, denoting by 0w the Wilcoxon test (same location and regression 
problems as above), 

ARE / (0 w /0 Ar ) > 0.864, (1.2) 

which implies that the price to be paid for using rank- or signed-rank tests of 
the Wilcoxon type (that is, logistic-score-based rank tests) instead of the tradi- 
tional Student ones never exceeds 13.6% of the total number of observations. That 
bound moreover is sharp, being reached under the Epanechnikov density /. On 
the other hand, the benefits of considering Wilcoxon rather than Student can be 
arbitrarily large, as it is easily shown that the supremum over / of AREj(0 w /0^) 
is infinite; see [20]. 

Both (1.1) and (1.2) created quite a surprise in the statistical community of 
the late fifties, and helped dispelling the wrong idea, by then quite widespread, 
that rank-based methods, although convenient and robust, could not be expected 
to compete with the efficiency of traditional parametric procedures. 

Chernoff- Savage and Hodges-Lehmann inequalities since then have been ex- 
tended to a variety of more general settings. In the elliptical context, optimal 
rank-based procedures for location (one and m-sample case), regression, and scat- 
ter (one and m-sample cases) have been constructed in a series of papers by Hallin 
and Paindaveine ([7], [11], and [13]), based on a multivariate concept of signed 
ranks. The Gaussian competitors here are of the Hotelling, Fisher, or Lagrange 



multiplier forms. For all those tests, Chernoff-Savage result similar to (1.1) have 
been established (see also [26, 27]). Hodges-Lehmann results also have been 
obtained, with bounds that, quite interestingly, depend on the dimension of the 
observation space: see [7]. 

Another type of extension is into the direction of time series and linear rank 
statistics of the serial type. Hallin [5] extended Chemoff and Savage's result (1.1) 
to the serial context by showing that the serial van der Waerden rank tests also 
uniformly dominate their Gaussian competitors (of the correlogram-based port- 
manteau, Durbin- Watson or Lagrange multiplier forms). Similarly, Hallin and 
Tribel [19] proved that the 0.864 upper bound in (1.2) no longer holds for the 
AREs of the Wilcoxon serial rank test with respect to their Gaussian competitors, 
and is to be replaced by a slightly lower 0.854 one. Elliptical versions of those 
results are derived in Hallin and Paindaveine ([8], [9], [10]). 

Now, AREs with respect to Gaussian procedures such as i-tests are not always 
the best evaluations of the asymptotic performances of rank-based tests. Their 
existence indeed requires the Gaussian procedures to be valid under the density / 
under consideration, a condition which places restrictions on / that may not be 
satisfied. When the Gaussian tests are no longer valid, one rather may like to 
consider AREs of the form 

AgEffalfa) = l/ARE f (fa/fa) (1.3) 

comparing the asymptotic performances (under /) of two rank-based tests 0j 
and <p K , based on score-generating functions J and K, respectively. Being distri- 
bution-free, rank-based procedures indeed do not impose any validity conditions 
on /, so that ARE /■((/> j/0/f) in general exists under much milder requirements 
on /; see, for instance, [17] and [18], where AREs of the form (1.3) are provided 
for rank-based methods in linear models with stable errors under which Student 
tests are not valid. 

Obtaining bounds for AREj ((/>j/(J>k), m general, is not as easy as for AREs of 
the form AREj((pj/(f>j^). The first result of that type was established in 1961 by 
Hodges and Lehmann, who in [21] show that 

< ARE / (0 w /0vdw) < 6/tt ps 1.910 (1.4) 

or, equivalently, 

0.524 « tt/6 < ARE / (0 vdW /0w) < oo (1.5) 



for all / in some class T of density functions satisfying weak differentiability 
conditions. Hodges and Lehmann moreover exhibit a parametric family of den- 
sities J^l = {fa | a E [0, oo)} for which the function a !->■ ARE/ Q (0 w /</> vdW ) 
achieves any value in the open interval (0, Q/n) (a i-> ARE/ Q (0 v dw/0w) achieves 
any value in the open interval (7r/6, oo)). The lower and upper bounds in (1.4) 
and (1.5) thus are sharp in the sense that they are the best possible ones. The 
same result was extended and generalized by Gastwirth [2]. 

Note that, in case / has finite second-order moments (so that ARE/(0w/0Af) 
is well defined), since ARE/(0 vdW /0^) = ARE/(0 vdW /0 w ) x ARE/(0 w /0 A r), 
Hodges and Lehmann's "6/7r result" implies that the ARE of the van der Waerden 
tests with respect to the Student ones, which by the Chernoff -Savage inequality is 
larger than or equal to one, actually can be arbitrarily large, and that this happens 
for the same types of densities as for the Wilcoxon tests. This is an indication that, 
when Wilcoxon is quite significantly outperforming Student, that performance is 
shared by a broad class of rank-based tests and J?-estimators, which includes the 
van der Waerden ones. 

In Section 2, we successively consider the traditional case of nonserial rank 
statistics used in the context of location and regression models with independent 
observations, and the case of serial rank statistics; the latter involve ranks at time t 
and t — k, say, and aim at detecting serial dependence among the observations. 
Serial rank statistics typically involve two score functions and, instead of (1.3), 
yield AREs of the form 

ARE}(0 JliJ2 /0 j3 , j4 ). (1.6) 

To start with, in Section 2.1, we revisit Gastwirth's classical nonserial results. 
More precisely, we provide (Proposition 2.2) a slightly different proof of the main 
proposition in [2], with some further illustrations in the case of Student scores. 
In Section 2.2, we turn to the serial case, with special attention for the so-called 
Wilcoxon-Wald-Wolfowitz, Kendall and van der Waerden rank autocorrelation 
coefficients. Serial AREs of the form (1.6) typically are the product of two factors 
to which the nonserial techniques of Section 2.1 separately apply; this provides 
bounds which, however, are not sharp. Therefore, in Section 3, we restrict to a 
few parametric families — the Student family (indexed by the degrees of freedom), 
the power-exponential family, or the Hodges-Lehmann family .Fhl — for which 
numerical values are displayed. 



2. Asymptotic relative efficiencies of rank-based procedures 

The asymptotic behavior of rank-based test statistics under local alternatives, 
since Hajek and Sidak [4], is obtained via an application of Le Cam's Third 
Lemma (see, for instance, Chapter 13 of [32]). Whether the statistic is of the 
serial or the nonserial type, the result, under a density / with distribution func- 
tion F involves integrals of the form 



£(«/):=/ J 2 (u)du, /C(J,/):=/ J(u)(p f (F-\u))du, 



and, in the serial case, 



J(J,f):= / J(u)F'\u)du 



where, assuming that / admits a weak derivative /', iff := —f'/f is such that 
the Fisher information for location 1(f) : = f Lp 2 AF~ l (u))du is finite. Denote 
by T the class of such densities. If local alternatives, in the serial case, are of the 
ARMA type, / is further restricted to the subset T2 of densities / E T having 
finite second-order moments. Differentiability in quadratic mean of f 1 ! 2 is the 
standard assumption here, see Chapter 7 of [32]; but absolute continuity of / in 
the traditional sense, with a.e. derivative /', is sufficient for most purposes. We 
refer to [4] and [16] for details in the nonserial and the serial case, respectively. 

2.1. The nonserial case 

In location or regression problems, or, more generally, when testing linear con- 
straints on the parameters of a linear model (this includes ANOVA etc.), the ARE, 
under density / e J, of a rank-based test <\>j x based on the square-summable 
score-generating function J\ with respect to another rank-based test 0j 2 based on 
the square-summable score- generating function J 2 takes the form 



Wf' (Ji ' h) ' wi,h c ' {J ^ ):= mry 



ASE t ^ Jl /^j 2 ) = '^{C 2 f (J h J 2 ), xvith C,{J U J,):= "),,[ , (2.1) 



provided that J\ and J 2 are monotone, or the difference between two monotone 
functions. Those ARE values readily extend to the m-sample setting, and to R- 
estimation problems. In a time-series context with innovation density / e T%, 
and under slightly more restrictive assumptions on the scores, they also extend to 



the partly rank-based tests and R-estimators considered by Koul and Saleh in [22] 
and [23]. 

Gastwirth (1970) is basing his analysis of (2.1) on an integration by parts of 
the integral in the definition of /C( J, /). If both Ji and J 2 are differentiable, with 
derivatives J[ and J' 2 , respectively, and provided that / is such that 

lim Ji(F(x))/(x) = = lim J 2 {F(x))f{x), 

x~>oo x— >-oo 

integration by parts in those integrals yields, for (2.1), 

/C(J 2 ) ( r J[{F{x))f 2 {x)dx\ 2 

^ w= Js[ r:S )' <2 ' 2) 

In view of the Chernoff-Savage result (1.1), the van der Waerden score-genera- 
ting function 

Mu) = Jvdw(w) = $-\u) (2.3) 

(with u i—T- $ -1 (u) the standard normal quantile function) may appear as a natural 
benchmark for ARE computations. From a technical point of view, under this 
integration by parts approach, the Wilcoxon score-generating function 

J*(u) = J w (u) = u-l/2 (2.4) 

(the Spearman-Wald-Wolfowitz score-generating function in the serial case) is 
more appropriate, though. Convexity arguments indeed will play an important 
role, and, being linear, J w is both convex and concave. Since J w (u) = 1 
and /C( J\y) = 1/12, equation (2.2) yields 

1 ( i 00 J[(F(x))f 2 (x)dx\ 2 
12 ARE, ( MM = — [ ] -^;^ > ) ■ («) 

Bounds on J[(F(x)) then readily yield bounds on AREs, irrespective of /. 

That property of Wilcoxon scores is exploited in Propositions 2.2 and 2.3 for 
non-serial AREs, in Propositions 2.4 for the serial ones; those bounds are mainly 
about AREs of, or with respect to, Wilcoxon (Spearman-Wald-Wolfowitz) proce- 
dures, but not exclusively so. 

Assume that / 6 F := {/ G F\ \\m. x ^.± 00 f(x) = 0}. Then, integration by 
parts is possible in the definition of /C( Jw, /), yielding 

POO 

£(J w ,/)= / f(x)dx. 



Assume furthermore that the square-integrable score-generating function J\ (the 
difference of two monotone increasing functions) is differentiable, with deriva- 
tive J[, and that 

/ G T Jx := {/ G Jol lim J x {F{x))f{x) = 0}, 

X— >±oo 

so that (2.2) holds. Finally, assume that J\ is skew- symmetric about 1/2. Defining 
the (possibly infinite) constants 



H 



J 



sup |J'(^)| and Kj := inf \J'{u) 



u>l/2 u>\/2 

we can always write 

12 ARE/ ($jj<for) < (k+) 2 //C(J 1 ) (2.6) 

while, if Ji is non-decreasing (hence J[ is non-negative), we further have 

(/^jV/qJx) < 12 ARE / {hJM < (kX) 2 /K:(Ji). (2.7) 

The quantities appearing in (2.6) and (2.7) often can be computed explicitly, yield- 
ing ARE bounds which are, moreover, sharp under certain conditions (see below). 
For example, if J\ is convex on [1/2, 1), its derivative J[ is non-decreasing 
over [1/2, 1), so that 

«- = J[(l/2) > and k\ = lim J[(u) < +oo. (2.8) 

It follows that, under the assumptions made, 

(J{(l/2)) 2 //C(J!) < 12 ARE/ (^/^w) < (lim J[(u)) 2 / IC^) . (2.9) 

The lower bound in (2.9) is established in Theorem 2.1 of [2]. 

The double inequality (2.9) holds, for instance (still, under / G J 7 ^), when the 
scores J\ — cp g o G~ l are the optimal scores associated with some symmetric and 
strongly unimodal density g with distribution function G; such densities indeed are 
log-concave and have monotone increasing, convex over [1/2, 1) score functions. 
Symmetric log-concave densities take the form 



/oo 
e~^ x) dx (2.10) 

■oo 



with x H- jjl{x) a convex, even (that is, /i(x) = /i(— x)) function; assume it to be 
twice differentiable, with derivatives /i' and ji" . Then, ip g (x) = //(x), so that 

/oo 
(fi'(x)) 2 g(x)dx = l(g) 
-oo 

where 1(g) the Fisher information of g (which we assume to be finite), and 
J[(u) = v"(G-\u))/g(G-\u)), hence J[(l/2) = ^y = ^- 

Specializing (2.9) to this situation, we obtain the following proposition. 

Proposition 2.1. If the square-integrable score-generating function J\ is of the 
form (f g o G~ x with g given by (2.10), /i even, convex, and twice differentiable, 
then, under any f G Tj v 

< 12X(g)ARE f (4 >Jl /^ w ) < (lim J[{u)) 2 = ( lim (//'(*)/ 'g{x)) 2 . 



A / u—ti 

(2.11) 

With /i(x) = x 2 /2 (so that K' 1 = \/2tt) in (2.10), g is the standard Gaus- 
sian density; yu"(0) = 1, 1(g) = 1, and the lower bound in (2.11 ) becomes 
(/j,"(0)/K) 2 = 2n, whereas the upper bound is trivially infinite. This yields the 
Hodges-Lehmann result (1.4). 

Turning back to (2.6) and (2.7), but with J x concave (and still non-decreasing) 
on [1/2, 1), J[ is nonincreasing, so that Kj = J{(l/2) and 

12 ARE 7 (<PjJM < (J((l/2)) 2 //C(Ji). (2.12) 

Not much can be said on the lower bound, though, without further assumptions 
on the behavior of Ji around u = 1. 

Replacing, for various score-generating functions J x and densities /, the quan- 
tities appearing in (2.6), (2.9) or (2.12) with their explicit values provides a variety 
of bounds of the Hodges-Lehmann type. Below, we consider the van der Waer- 
den tests </> v dw> based on the score-generating function (2.3) and the Cauchy-score 
rank tests 0cauch y > based on the score-generating function 

</cauch» = sin(27r(n - 1/2)). (2.13) 



Wil/VdW 



Cau/VdW 





Figure 1: AREj(0w/</ ) vdw) an d AREy-(0cauchy/0vdw) under various families of den- 
sities: symmetric stable (indexed by their tail parameter a), Student-t (indexed by their 
degrees of freedom v) or Pareto (indexed by their shape parameter a). 



Proposition 2.2. For all symmetric densities f in Jvaw. -^Cauchy and ^vaw D ^~Cauchy> 
respectively, 

(i) ARE f (4> w /(j) vdW ) < 6/tt; 
(ii) AREfifc^y/M < 2vr 2 /3; 

(Hi) ARE /(0Cauchy/0vdw) < 4vr. 

Proof. The van der Waerden score (2.3) is strictly increasing, and convex over 
[1/2, 1). One readily obtains 

/C(J vdW ) = 1 and j; dW (n) = v / 2Texp{($- 1 (n)) 2 /2}, 



hence ft^dw = <-Cdw(l/2) = v27T- Plugging this into the left-hand side inequality 
of (2.9) yields (i). Alternatively one can directly apply (2.11). 

The Cauchy score is concave over [1/2, 1), but not monotone (being of bounded 
variation, however, it is the difference of two monotone function). Direct inspec- 
tion of (2.13) nevertheless reveals that 

/C(Jcauchy) = 1/2 and J' Ca uchy( u ) = 27rcos(27r(n - 1/2)), 

hence K+ auchy = Ji auchy (l/2) = 2tt. Substituting this in (2.6) yields (ii). The 
product of the upper bounds in (i) and (ii) yields (iii). □ 

Remarkably, those three bounds are sharp. Indeed, numerical evaluation shows 
that they can be approached arbitrarily well by taking extremely heavy-tails such 
as those of stable densities f a with tail index a — > 0, Student densities with de- 
grees of freedom v — > 0, or Pareto densities with a — > 0; see also the family J^l 
of densities f a ,e(x) defined in equation (3.1). 

Figure 1 provides plots of ARE / (0 w /0 vdW ) and ARE / (0 Cauchy /0 vdW ) for 
various densities. Inspection of those graphs shows that both AREs are decreasing 
as the tails become lighter; the sharpness of bounds (i) and (iii), hence also that of 
bound (ii), is graphically confirmed. 

The bounds proposed in Proposition 2.2 are not new, and have been obtained 
already in [2]. One would like to see similar bounds for other score functions, 
such as the Student ones 

J tv {u) = {v + l) F-\u)/{v + F-\u ) 2 ) 0<«<1 

_1 + ttw-, 1 o J M 1 ~ 2u ) 1/2 < m < 1 (2.14) 



v^7 V IB -( 1 " 2u ) 

10 



where TB v (v) denotes the inverse of the regularized incomplete beta function eval- 
uated at (l,v, v/2, 1/2) and F^ 1 stands for the Student quantile function with v 
degrees of freedom. Note that linxu->._i IB !/ (w) = 0, so that lim u _>i Jt v { u ) = 0- 
Since Jt„(l/2) = and J' tu (l/2) > 0, this means that, on [1/2, 1), J tu is a re- 
descending function; in general, it is neither convex nor concave on [1/2, 1). 
Differentiating (2.14), we get, for u > 1/2, 

W = ^wlln (- 1 + 2IB,(1 - 2u))IB i ,(l - 2u)^, (2.15) 

from which we deduce that 

< v < 1 
n Ji (u) = I -2ir v = 1 

— OO 1 < V 



u-»l tv 



Except for the v — 1 case, which is covered by (ii) and (iii) in Proposition 2.2, 
these values do not provide exploitable values for k + . For v < 1, however, one 
can check from (2.15) that max u >i/ 2 |^'(a ; )| = ^'(1/2), so that 



Jt, 



■*«) r (D/^(T) 



Elementary though somewhat tedious algebra yields 

/C(JO = (i/ + l)/(i/ + 3). 

Plugging this into (2.6), we obtain, for z/ < 1, the following additional bounds. 

Proposition 2.3. For all < u < 1 and all symmetric density f in Tj tv and 
?-h v D -Tvvdw respectively, 

(iv) AREfifaJfa) < 7rr 2 (f )(u + 3)(u + l)/12z/r 2 (^±±), and 

(vj ARE f (<j> t J(f> vdW ) < r 2 (|)( I / + 3)(^ + l)/2z/r 2 (^±i). 

Inequality (iv) is sharp, the bound being achieved, in the limit, under very heavy 
tails (stable densities with a j 0, or Student-t^ densities with \x \. 0). Since 
this is also the case, under the same sequences of densities, for inequality (i) in 
Proposition 2.1, inequality (v) is sharp as well. The upper bounds (iv) and (v) are 
both decreasing functions of the tail index u; both are unbounded at the origin, 
and both converge to the corresponding Cauchy values as v (->• 1. 

11 



2.2. The serial case 

Until the early eighties, and despite some forerunning time-series applications 
such as Wald and Wolfowitz [33] (published as early as 1943 — two years before 
Frank Wilcoxon's pathbreaking 1945 paper [34]!), rank-based methods had been 
essentially limited to statistical models involving univariate independent obser- 
vations. Therefore, the traditional ARE bounds (Hodges and Lehmann [20, 21], 
Chemoff-Savage [1] or Gastwirth [2]), as well as the classical monographs (Hajek 
and Sidak [4], Randies and Wolfe [30], Puri and Sen [29], to quote only a few) 
mainly deal with univariate location and single-output linear (regression) mod- 
els with independent observations. The situation since then has changed, and 
rank-based procedures nowadays have been proposed for a much broader class of 
statistical models, including time series problems, where serial dependencies are 
the main features under study. 

In this section, we focus on the linear rank statistics of the serial type involv- 
ing two square-integrable score functions. Those statistics enjoy optimality prop- 
erties in the context of linear time series (ARMA models; see [16] for details). 
Once adequately standardized, those statistics yield the so-called rank-based au- 
tocorrelation coefficients. Denote by R\ , . . . , Rn the ranks in a triangular ar- 
ray X[ , . . . , Xn of observations. Rank autocorrelations (with lag k) are linear 
serial rank statistics of the form 

.n+l/ Vn + 1 



'Ji J2;fc 



»-*>- , E*(£tW^) -•»&]( 



i«&)' 



t=k+l 



(n) 

where J\ and J 2 are (square-integrable) score-generating functions, whereas rrij j 
and Sj iJ2 := Sj iJrk denote the exact mean of J\ ( -^-^ ) J 2 ( -^tj- ) and the ex- 

act standard error of (n — ^)~ 5 X]™=fc+i^i ( ^+i ) ^ ( ~^Ti ) un der the assumption 

of i.i.d. Xf's (more precisely, exchangeable R^ s), respectively; we refer to 
pages 186 and 187 of [16] for explicit formulas. Signed-rank autocorrelation 
coefficients are defined similarly; see [15] or [16]. 

Rank and signed-rank autocorrelations are measures of serial dependence of- 
fering rank-based alternatives to the usual autocorrelation coefficients, of the form 



r k 



2 , x t x t _ k /y ^xl, 



t=k+l t=l 
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which consitute the Gaussian reference benchmark in this context. Of particular 
interest are 



(i) the van der Waerden autocorrelations [14] 



-vdW;fc 



[n 



kr 1 E &- 1 



R 



in) 



t=k+l 



n 



<r 



R 



(n) 
t-Jfc 



n 



m 



(«) 

vdW 



( 6 vdWj ' 



(ii) the W r a/J-Wb//bwzYz or Spearman autocorrelations [33] 



"SWW:fc 



,n 



,(n) p(n) 



*7 ' 2^ -^i -^t-fc m SWW 



» 



i=fc+l 



rswwj 



(iii) and the Kendall autocorrelations [3] (where explicit values of m£' and s^ 



are provided) 



,(") 

'K;fc 



4L> 



(n) 



(n — k)(n — k — I) 



m 



(») 



(•?')-' 



)W 



with Dl denoting the number of discordances at lag k, that is, the number 



of pairs (J2J B \i£i) 



>(") i?(ft) 



and (i2r\ ^- fe ) that satisf y either 



i?; n) <i?W and i? t W fc > R™ or i? 4 (n) > i?^ and itf^ < R 



(n) . 



)(») . 



(») 



,(n) D (n) 



more specifically, D™ := E?= fc+ i EL + i W < ^, J2™ > « 



s-fc; 



The van der Waerden autocorrelations are optimal — in the sense that they allow 
for locally optimal rank tests in the case of ARMA models with normal innova- 
tion densities. The Spearman and Kendall autocorrelations are serial versions of 
Spearman's rho and Kendall's tau, respectively, and are asymptotically equivalent 
under the null hypothesis of independence; although they are never optimal for 
any ARMA alternative, they achieve excellent overall performance. Signed rank 
autocorrelations are defined in a similar way. 

Let Ji, i = 1, . . . , 4 denote four square- summable score functions, and as- 
sume that they are monotone increasing, or the difference between two mono- 
tone increasing functions (that assumption tacitly will be made in the sequel each 
time AREs are to be computed). Recall that F 2 denotes the subclass of densi- 
ties / E T having finite moments of order two. The asymptotic relative effi- 
ciency, under innovation density / E J~2, of the rank-based tests 4> r j 1 j 2 based on 



the autocorrelations r ^ with respect to the rank-based tests 

~J\ J%\k 



J3J4 



based on the 
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autocorrelations r^ is 

ARE^W^J 

= £(Jfe) f Jj Ji(v)(Pf(F-\v))dv \ 2 KW f Sj J 2 (v)F- 1 (v)dv 
~ Wi) \tiUv)<p f (F- l (v))dv) K.{J 2 ) \^J 4 (v)F- i (v)dv / 

=Wfi {j ^W>*> {j * j * (2 - 16) 

with C f {J 1} h) := K{J 1} f)/K{ J 3 , /) and D f (J 2 , J 4 ) := J(J 2 , f)/J{J it /). 

The Cf ratios have been studied in Section 2.1, and the same conclusions apply 
here; as for the Df ratios, they can be treated by similar methods. 

Denote by fcw, 6™, 0qww, • • • the tests based on r^ , r^ , r^ 

J VvdW' VW VbWW' ~vdW;fc ~W;fc ~SWW;fc 

etc. The serial counterpart of AREj(0 w /0j 1 ) is AREj(0g WW /0} iJ2 ), for which 
the following result holds. 

Proposition 2.4. Let the score functions J\ and J 2 be monotone increasing, skew- 
symmetric about 1/2, and differentiable, with strictly positive J[(l/2) and J 2 (l/2). 
Suppose that / e J 2 f| -^Vi D -^Va * s a symmetric probability density function. 
Then, if J\ and J 2 are 

(i) convex on [1/2, 1), 
(ii) concave on [1/2, 1), 
Proo/ In view of (2.1), we have 



ARE / (0 sww /0 JlJ J = ARE/(0 w /0 Jl ) — - " — — — — 

M-VJ V J J 2 (w)F- 1 (w)dw 



2 



, if 
Consider part (i) of the proposition. It follows from (2.7) that 

x2 



AKEf {fa/fa) < 12X:(J0/(J((l/2)) 2 . 
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Since J 2 is convex over [1/2, 1), J 2 (u) > 4(l/2)(u - 1/2) for all u G [1/2, 1), 
so that 

f J 2 {v)F~\v)&v = 2 [ J 2 {v)F-\v)dv > 4(1/2) / (v - l^F" 1 ^)^. 

io A/2 A/2 

It follows that 

/C(J 2 ) / />-l/2)F->)ck; V 12/C(J 2 ) 
*W V /o^f)^ 1 ^ / " (4(V2)) 2 ' 

where the assumption of finite variance is used. Part (i) of the result follows. 
A similar argument holds (with reversed inequalities) if J 2 is concave, yielding 
part (ii). □ 

Applying this result to the score functions J±(u) = J 2 (u) = $ _1 (m) (convex 
over [1/2,0)) for which 4(1/2) = 4(1/2) = v^ and K{J X ) = JC(J 2 ) = 1, 
we readily obtain the following serial extension of Hodges and Lehmann's "6/7T 
result": 

ARE}(^ ww /^ dW ) = ARE^Maw) < (6/^r) 2 - (2-17) 

An important difference, though, is that the bound in (2.17) is unlikely to 
be sharp. Section 3 provides some numerical evidence of that fact, which is 
hardly surprising: while the ratio C/(J v dw, <Av) is maximized for densities putting 
all their weight about the origin, this no longer holds true for Df( J v dw, <Av)- 
In particular, the sequences of densities considered in [21] or [2] along which 
Cf(J v dw, Jw) tends to its upper bound typically are not the same as those along 
which Df(J vdW , J w ) does. 

3. Some numerical results 

In this final section, we provide numerical values of AR£y(0w/0vdw) (de- 
noted as AREf in the sequel) and ARE*f(<j) r sww / <p r vdW ) (denoted as ARE* f in the 
sequel) under various families of distributions. 

First, let us give some ARE values under Gaussian densities: if / = 0, we 

obtain 

1 
C<h(Jw, Jvdw) = DmJ-Wj <Adw) = n i— ~ 0.28209 



so that 



AREJ(f> w /<f>vdw) = - « 0.95493 

7T 
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e 


Cf 


Df 


ARE f 


ARE} 





.398942 


.282070 


1.90986 


1.82346 


.2 


.396313 


.276619 


1.88476 


1.73062 


.4 


.388772 


.271848 


1.81372 


1.60844 


.6 


.377291 


.271061 


1.70818 


1.50608 


1 


.348213 


.287973 


1.45503 


1.44796 


2 


.294160 


.303085 


1.03836 


1.14461 


3 


.282852 


.285646 


.960064 


.940023 


10 


.282095 


.282095 


.954930 


.911891 


100 


.282095 


.282095 


.954930 


.911891 



Table 1: Numerical values of Cf, D f , ARE f = ARE f (</>w / <j> vdW ) and ARE} = 
Ai?i?!(</)g WW /^ dw ) under densities / 0>e in the Hodges-Lehmann family J-"hl ( see (3.1)), for 
various values of e and a — >• 0. 



and 
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^WswwMdw) = ~ 2 « 0.91189. 
Tables 1-3 provide numerical values of AREf and ARE} under 



(i) (Table 1) the two-parameter family J^l of densities f a>e associated with the 
distribution functions 



Fa*(x) 



$(x) if < x < e 

$(e + a(x -e)) if e < x 



(3.1) 



where F a>e (x) is defined by symmetry for x < (this family of distribu- 
tions, which has been used by Hodges and Lehmann [21], is such that the 
nonserial 6/ir bound is achieved, in the limit, as both a and e go to zero), 

(ii) (Table 2) the family J-"s tu dcnt of Student densities with degrees of freedom 
v > 0, and 

(iii) (Table 3) the family T e of power-exponential densities, of the form 

P -\A a 



fa(x) 



2T(1 + 1/a) 



i6R, a > 0. 



(3.2) 



All tables seem to confirm the same findings : both the serial and the non-serial 
AREs are monotone in the size of the tails, with the non- serial AREj attaining its 
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V 


c f 


Df 


ARE f 


ARE} 


0.1 


.394451 


- 


1.86710 


- 


1 


.343120 


- 


1.41277 


- 


2 


.321212 


.243196 


1.23813 


.878736 


4 


.304695 


.269173 


1.11407 


.968623 


6 


.297953 


.274541 


1.06531 


.963551 


8 


.294303 


.276784 


1.03937 


.955507 


10 


.292017 


.278005 


1.02329 


.949042 


100 


.283146 


.281737 


.962059 


.916370 



Table 2: Numerical values of C f , D f , AREf 



ARE f ((j>vj/<l) v dw) and ARE} 



ARE}((f>g WW /(f> 7 vdw ) under Student-t densities with various degrees of freedom v. 



a 


Cf 


Df 


ARE f 


ARE} 


0.1 


.393903 


M '5222 


1.86191 


0.685991 


1 


.313329 


.2720600 


1.1781 


1.046388 


2 


.282095 


.2820950 


.954930 


.911893 


10 


.222095 


.2934363 


.591916 


.611600 


100 


.168549 


.2953577 


.340904 


.356871 



Table 3: 

ARE* f {4, 
ters a. 



Numerical values of C 

sww/'A 



/, Df, AREf = AREf((/) W /(j) vdW ) and ARE} = 
vdw) un der power exponential densities for various values of the shape parame- 
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non serial ARE 



serial ARE 





Figure 2: Nonserial AREj = AREf((p^/(j) vdW ) (left plot) and serial ARE} = 
j4i?£^t((/)g WW /0y dW ) (right plot) under densities f a>e in the Hodges-Lehmann family 
^hl ( see (3.1)), as a function of e G [0, 4]], for various choices of the parameter a. 
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Student distribution 



Power exponential distribution 





W/ 



Vdw) and ARE 



U v^SWW 



/^vdw) fOT f" the Student 



Figure 3: Left plot : ARE/,, 

distribution, as a function of the degrees of freedom v G [2,6]. Right plot : AREj a and 
ARE^ for the power exponential densities f a (3.2), as a function of the shape parameter 
a£ [0,11]. 
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maximal value (6/-7T « 1.90986) under heavy-tailed / densities, while the max- 
imal value for the serial AREJ. lies somewhere around (6/7r)(3/7r) ~ 1.82346. 
Inspection of Table 1 reveals that, although the limit of Cf as a — > is monotone 
in the parameter e, the ratio Df is not; from Table 3, the highest values of Df 
under the distribution (3.1) are attained for a — > oo and e w 0. 

Under Student densities / = / tl/ , the nonserial AREj is decreasing with v, tak- 
ing value 1.41277 at the Cauchy (y = 1), value one about v = 15.42 (a value of v 
that is not shown in the figure; Wilcoxon is thus outperforming van der Waerden 
up to v = 15 degrees of freedom, with van der Waerden taking over from v — 16 
on), and tending to the Gaussian value 0.95493 as v — > oo; the serial AREj is un- 
defined for v < 2, increasing for small values of v, from an infimum of 0.878736 
(obtained as v \ 2) up to a maximum of 0.968852 (reached about v = 4.24), then 
slowly decreasing to the Gaussian value 0.911891 as v — > oo. Sperman-Wald- 
Wolfowitz and Kendall thus never outperform van der Waerden autocorrelations 
under Student densities. 

Under the double exponential densities f = f a , the nonserial AREj is de- 
creasing with a, with a supremum of 6/n (the Hodges-Lehmann bound, obtained 
as a J, 0), and reaches value one about a = 1.7206 (similar local asymptotic per- 
formances of Wilcoxon and van der Waerden, thus, occur at power-exponentials 
with parameter a = 1.7206); the serial AREJ. is quite bad as a | 0, then rapidly 
increasing for small values of a, with a maximum of 1.08552 about a = 0.510, 
then deteriorating again as a — > oo; for a larger than 3, the serial and nonserial 
AREs roughly coincide. 
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